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pH ■ Abstract 

Symmetrical Multilevel Diversity Coding (SMDC) is a network compression problem 
■ introduced by Roche (1992) and Yeung (1995). In this setting, a simple separate coding 

strategy known as superposition coding was shown to be optimal in terms of achieving 
\ the minimum sum rate (Roche, Yeung, and Hau, 1997) and the entire admissible rate 

region (Yeung and Zhang, 1999) of the problem. This paper considers a natural general- 
ization of SMDC to the secure communication setting with an additional eavesdropper. 
It is required that all sources need to be kept perfectly secret from the eavesdropper as 
long as the number of encoder outputs available at the eavesdropper is no more than a 
given threshold. First, the problem of encoding individual sources is studied. A precise 
characterization of the entire admissible rate region is established via a connection to 
the problem of secure coding over a three-layer wiretap network and utilizing some basic 
polyhedral structure of the admissible rate region. Building on this result, it is then 
shown that superposition coding remains optimal in terms of achieving the minimum 
' sum rate for the general secure SMDC problem. 



1 Introduction 

Symmetrical Multilevel Diversity Coding (SMDC) is a source coding problem with L indepen- 
dent discrete memoryless sources (Si, . . . , Sl), where the importance of the sources is assumed 
to decrease with the subscript I. The sources are to be encoded by a total of L encoders, where 
the rate of the /th encoder output is Ri. The decoder can access a subset U C Q L : = {1, . . . , L} 
of the encoder outputs. Which subset of the encoder outputs is available at the decoder is 
unknown a priori at the encoders. However, no matter which subset U actually realizes, the 
sources (Si, . . . , S m ) need to be asymptotically perfectly reconstructed at the decoder when- 
ever \U\ > m. Note that the word "symmetrical" here refers to the fact that the sources that 
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need to be reconstructed at the decoder depend on the available subset of the encoder outputs 
only via its cardinality. The rate allocations at different encoders, however, can be different 
and are not necessarily symmetrical. 

The problem of Multilevel Diversity Coding (MDC) was introduced by Roche [1] and Yeung 
[2] in the early 1990s. In particular, [2] considered the simple coding strategy of separately 
encoding different sources at the encoders, subsequently referred to as superposition coding. 
The aforementioned SMDC problem was first systematically studied in [3], where it was shown 
that superposition coding can achieve the minimum sum rate for the general SMDC problem 
(with an arbitrary total number of encoders L) and the entire admissible rate region with 
L = 3 encoders. The problem regarding whether superposition coding can achieve the entire 
admissible rate region for the general SMDC problem, however, remained open. Finally, in a 
very elegant (albeit highly technical) paper [I], Yeung and Zhang resolved the open problem 
by positive through the so-called a-resolution method. 

Recent years have seen a flurry of research on information-theoretic security. See [5] and [Hj 
for surveys of recent progress in this field. Motivated by this renewed interest, in this paper 
we consider the problem of Secure Symmetrical Multilevel Diversity Coding (S-SMDC) in the 
presence of an additional eavesdropper. Specifically, a collection of L — N independent discrete 
memoryless sources (S\, . . . , Sl-n) are to be encoded by a total of L encoders, where the 
rate of the /th encoder output is Ri. A legitimate receiver and an eavesdropper can access 
a subset U C Q L and A C Q L of the encoder outputs, respectively. Which subsets of the 
encoder outputs are available at the legitimate receiver and the eavesdropper are unknown 
a priori at the encoders. However, no matter which subsets U and A actually occur, the 
sources (Si, . . . , S^) need to be asymptotically perfectly reconstructed at the legitimate receiver 
whenever \U\ > N + k, and the entire collection of the sources (Si, . . . , Sl-jv) needs to be kept 
perfectly secret from the eavesdropper as long as \A\ < N. As before, the word "symmetrical" 
here refers to the access structure at the legitimate receiver and the eavesdropper, but not to 
the rate allocations at different encoders. We envision that such a communication scenario 
is useful for designing distributed information storage systems [1] where information retrieval 
needs to be both robust and secure. 

As mentioned previously, separate encoding of different sources (superposition coding) can 
achieve the entire admissible rate region for the general SMDC problem without any secrecy 
constraints [I]. It is thus natural to ask whether the same separate encoding strategy would 
remain optimal for the general S-SMDC problems. For the classical SMDC problems without 
any secrecy constraints, the problem of efficient encoding of individual sources is essentially 
to transmit the source over an erasure channel and is well understood based on the earlier 
work of Singleton [7]. For the S-SMDC problems, however, the problem of efficient encoding 
of individual sources is closely related to the problem of secure coding over a Wiretap Network 
(WN) [8|, which, in its most general setting, is a very challenging problem in information- 
theoretic security. 

The rest of the paper is organized as follows. In Sec. [2l we focus on the problem of 
encoding individual sources, i.e., Secure Symmetrical Single-level Diversity Coding (S-SSDC). 
By leveraging the results of [8] on secure coding over a three-layer WN and utilizing some 
basic polyhedral structure of the admissible rate region, we provide a precise characterization 
of the entire admissible rate region for the general S-SSDC problem. Building on this result, 
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in Sec. |3]we show that superposition coding can achieve the minimum sum rate for the general 
S-SMDC problem. Finally, in Sec. H] we conclude the paper with some remarks. 

2 Secure Symmetrical Single-Level Diversity Coding 
2.1 Problem Statement 

Let be a discrete memoryless source with time index t and let S n := (5[1], . . . , S[n]). 

An (L, N, m) S-SSDC problem consists of a set of L encoders, a legitimate receiver who has 
access to a subset U C Q L of the encoder outputs, and an eavesdropper who has access to a 
subset A C Q L of the encoder outputs. Which subsets of the encoder outputs are available at 
the legitimate receiver and the eavesdropper are unknown a priori at the encoders. However, 
no matter which subsets U and A actually occur, the legitimate receiver must be able to 
asymptotically perfectly reconstruct the source whenever \U\ > m, and the source must be 
kept perfectly secret from the eavesdropper as long as \A\ < N. Obviously, reliable and secure 
communication of the source is possible only when m > N. 

Formally, an (n, (Mi, . . . , Ml)) code is defined by a collection of L encoding functions 

e, :S n x /C-> {1,...,M}, V/ = 1,...,L (1) 

and decoding functions 

d v : JJ{1,...,M Z } ^S n , VU C Q L s.t. \U\ > m. (2) 

Here, /C denotes the key space accessible to all L encoders. There are no limitations on the 
size of the key space /C. However, the secret key is only shared by the encoders, but not 
with the legitimate receiver or the eavesdropper. A nonnegative rate tuple (Ri,...,Rl) is 
said to be admissible if for every e > 0, there exits, for sufficiently large block length n, an 
(n, (Mi, . . . , Ml)) code such that: 

• (Rate constraints) 

-logM<^ + e, V/ = 1,...,L; (3) 

n 

• (Asymptotically perfect reconstruction at the legitimate receiver) 

Yi{du(Xu) S n } < e, W C VL L s.t. |C/| > m (4) 

where X; := ei(S n , K) is the output of the /th encoder, K is the secret key shared by all 
L encoders, and Xjj := {X t : I G [/}; and 

• (Perfect secrecy at the eavesdropper) 

H(S n \X A ) = H{S n ), VA C Q L s.t. \A\ < N (5) 

i.e., observing the encoder outputs Xa does not provide any information regarding to 
the source sequence S n . 
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The admissible rate region 1Z is the collection of all admissible rate tuples (Ri, . . . ,Rl)- 
The minimum sum rate R ms is defined as 



R ms ■= min R t . (6) 

{Rx,...,R L )en^ 



2.2 Main Results 

The following lemma provides a simple outer bound on the admissible rate region of the general 
S-SSDC problem. Let TZ(L, k, H) be the collection of all nonnegative rate tuples (R\, . . . , Rl) 
satisfying 

Y,Ri>h, v^enf (7) 

(k) 

where £l L is the collection of all subsets of of size k. 

Lemma 1. For any (L,N,m) S-SSDC problem, the admissible rate region 

KC1l(L,m-N,H(S)). (8) 

Lemma [1] can be proved using standard information-theoretic techniques. For completeness, 
a proof is included in Appendix |A] The above outer bound is known to be tight in the following 
two special cases: 

1) When N = 0, the (L,N,m) S-SSDC problem reduces to the classical (L,m) SSDC 
problem without any secrecy constraints, for which the admissible rate region is known [7] 
to be n(L,m,H(S)). 

2) With N > but m — N + 1, a collection D of the encoder outputs will either lead to an 
asymptotically perfect reconstruction of the source (whenever \D\ > N + 1), or provide 
zero information on the source (whenever \D\ < N). In this case, the (L,N,m) S-SSDC 
problem reduces to the classical (L, N) threshold secret sharing problem, for which the 
admissible rate region is known PdO] to be 1Z(L, 1, H(S)). 

The main result of this section is that the outer bound TZ(L,m — N,H(S)) is in fact 
the admissible rate region for the general S-SSDC problem, as summarized in the following 
theorem. 

Theorem 1. For any (L,N,m) S-SSDC problem, the admissible rate region 

K = K(L,m- N,H(S)). (9) 

A proof of the theorem is provided in Sec. 12.31 To show that every rate tuple in TZ(L, m — 
N, H(S)) is admissible, our proof proceeds in the following two steps. First, we show that for 
any (L, N, m) S-SSDC problem, the symmetrical rate tuple (H(S)/ (m—N), . . . , H(S)/(m—N)) 
is admissible. In our proof, this is accomplished by relating the S-SSDC problem to the problem 
of secure coding over a three-layer WN and using the result of [HI Th. 3] on an achievable secrecy 
rate for the generic WN. Building on the previous result, next we show that every rate tuple 
in TZ(L, m — N, H(S)) is admissible via an induction argument (inducting on the total number 
of encoders L) and the following polyhedral structure of TZ(L, k, H). 



4 



Proposition 1. 1Z(L, k, H) is a pointed polyhedron in M L with the following structural prop- 
erties: 

1) The characteristic cone oflZ(L, k, H) is given by {(Ri, ■ ■ ■ , Rl) '■ Ri > 0, V/ = 1, . . . , L}. 

2) Among all corner points (vertices) of 1Z(L, k, H) , (H/k, . . . ,H/k) is the only one with 
all strictly positive entries (if there exists any). 

3) For any I = 1, . . . , L, the Ri = slice ofK.iL, k, H) is isomorphic to 1Z(L — 1, k — 1, H). 
In particular, the Rl = slice oflZ(L, k, H) is identical to 1Z(L — 1, k — 1, H), i.e., 

{(R h R L -i) :(#!,..., R L -i, 0) 6 Tl{L, k, H)} = K(L - 1, k - 1, H). (10) 

Proof. Property 1 follows directly from the definition of characteristic cone [TTl Lec. 2]. Prop- 
erty 2 is due to the fact that 

(R u ...,R L ) = (H/k,...,H/k) (11) 

is a solution to the equations 

Y,Ri = H, VDefif. (12) 

l&D 

To see property 3, note that the Ri = slice of TZ(L, k, H) is given by all nonnegative rate 
tuples (Ri,..., Ri-!, Ri+i, R L ) satisfying 



Y,R d >H, e u ng> {0 (13) 



where fi^L denotes all subsets of Vt L \ {/} of size k. Since every inequality with D £ ^l\U} 

is dominated by every inequality with D' £ ^L\m anc l sucn ^ na ^ F)' C D, we have the desired 
property. □ 

Fig. [1] illustrates the above polyhedral structure of TZ(L, k, H) for L = 2 and 3. The 
following corollary summarizes the minimum sum rate for the general S-SSDC problem. 

Corollary 2. For any (L,N,m) S-SSDC problem, the minimum sum rate 

Rms = —^—H(S). (14) 
m — JM 



Proof. Let us first verify that 



L 



min \^Ri = -H. (15) 

(R u ...,R L )e1l(L,k,H) t-^ k 

For any rate tuple (Ri, . . . , Rl) £ 7Z(L, k, H), we have 

J2 R i> H , V£>en[ fc) . (16) 
leD 




Figure 1: Illustration of the rate region lZ(L,k,H) for L = 2 and 3. The Ri = slices of 
7^.(3, 1, H) are empty sets, and the Ri = slices of 7^(3, 2, H) are isomorphic to i?(2, 1,H). 

(k) 

Summing over all D G £l L gives 



We thus have 

L 



L 



y fc — 1 

for any rate tuple (R\, . . . , i?^) G 7£(L, k, H). On the other hand, note that the symmetrical 
rate tuple 

(H/k, . . . , H/k) G K{L, k, H) (19) 

so 

L L 

min ^ Ri< -rH. (20) 

(Ri,...,R L )£ll{L,k,H)^-^ k 

Combining (ITS]) and (1211 completes the proof of ffTo^) . 
Now by Theorem [TJ 

l L L 
R ms = min V Ri = min V flj = tt#(S). (21) 

{Ri,...,R L )&nj-^ (R 1 ,...,R L )eTZ(L,m-N,H(S)) 777, - iV 

This completes the proof of the corollary. □ 
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Figure 2: Illustration of the (L, N,m, (R u . . . , R L )) WN. 
2.3 Proof of Theorem [J 

Let us first show that the symmetrical rate tuple (H(S) / (m — N), . . . , H(S) / (m — N)) is admis- 
sible by considering the following simple source-channel separation scheme for the (L, N, m) 
S-SSDC problem: 

• First compress the source sequence S n into a source message W using a fixed-length 
lossless source code. It is well known [121 Ch. 3.2] that the rate R of the source message 
W can be made arbitrarily close to the entropy rate H(S) for sufficiently large block 
length n. 

• Next, the source message W is delivered to the legitimate receiver using a secure 

(L,N,m,(R u ...,R L )) 

WN code. 

The problem of secure coding over a WN was formally introduced in [8]. A generic WN 
(Q,s,U,A) consists of a directed acyclic network Q, a source node s, a set of user nodes U, 
and a collection of sets of wiretapped edges A. Each member of A may be fully accessed by 
an eavesdropper, but no eavesdropper may access more than one member of A. The source 
node has access to a message W, which is intended for all user nodes in U but needs to be 
kept perfectly secret from the eavesdroppers. The maximum achievable secrecy rate for W is 
called the secrecy capacity of the WN and is denoted by C S (G, s,U,A). 

An (L, N, m, (Ri, . . . , Rl)) WN is a special WN with three layers of nodes: top, middle, 
and bottom. As illustrated in Fig. |2l the only node in the top layer is the source node s. There 
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are L intermediate nodes in the middle layer, each corresponding to an encoder in the (L, N, m) 
S-SSDC problem. For each I = 1, . . . , L, the source node s is connected to the intermediate 
node / by a channel (s, I) with capacity Ri. There are 

|W| = ( L m ) (22) 

user nodes in the bottom layer, each corresponding to a possible realization of the legitimate 
receiver in the (L,N,m) S-SSDC problem and is connected to m intermediate nodes through 
m infinite-capacity channels. Finally, the collection of sets of wiretapped edges A is defined as 

A:={{(s,l)\leA}: Aetl^l (23) 

where each set of wiretapped edges in A corresponds to a possible realization of the eavesdrop- 
per in the (L, N, m) S-SSDC problem. 

Based on the aforementioned connection between the (L, N, m) S-SSDC problem and the 
problem of secure coding over the (L, N, m, (R\, . . . , Rl)) WN, we have the following simple 
lemma. 

Lemma 2. A nonnegative rate tuple (Rx,...,Rl) is admissible for the (L,N,m) S-SSDC 
problem if the entropy rate of the source is less than or equal to the secrecy capacity of the 
(L,N,m,(R 1 ,...,R L )) WN, i.e., 

H(S)<C s (L,N,m,(R l ,...,R L )). (24) 

In general, characterizing the exact secrecy capacity of a WN can be very difficult. For a 
generic WN (Q,s,U,A), the following secrecy rate 

R s = min [mincut(s, u) — mincut(s, A)] (25) 

is known [8] to be achievable. Here, mincut(s, u) denotes the value of a minimum cut between 
the source node s and the user node u, and mincut(s, A) denotes the value of a minimum cut 
between the source node s and the set of wiretapped edges A. For the (L, N, m, (H(S)/(m — 
N), . . . , H(S)/(m — N))) WN, it is straightforward to verify that 



and 



Hence, the secrecy rate 



771 

7nincut(s,u) = —H(S), Vn £ IA (26) 

m — N 



N 

mincut(s,A) = zzH(S), VA £ A. 27 

m — N 



R s = -JH—H(S) ^H(S) = H(S) (21 

m-N v ; m-N v ; v ; v 



is achievable for the (L, N, m, (H(S) j(m — N), . . . , H(S) j(m — N))) WN. We summarize this 
result in the following lemma. 
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Lemma 3. For any (L,N,m,(H(S)/(m — N), . . . ,H(S)/(m — N))) WN, the secrecy capacity 
can be bounded from below as 



C S (L, N, m, (H(S)/(m -N),..., H(S)/(m - N))) > H(S). 



(29) 



Combining Lemmas [2] and [3] proves the admissibility of the symmetrical rate tuple 



(H(S)/(m-N),...,H(S)/(m-N)). 



Building on the previous result, next we show that every rate tuple in TZ(L, m — N, H(S)) 
is admissible. By Proposition [TJ lZ(L,m — N,H(S)) is a pointed polyhedron with the char- 
acteristic cone given by {(Ri, ■ ■ ■ , Rl) '■ Ri > 0, V/ = 1, . . . , L}. Thus, to show that all rate 
tuples in lZ(L,m — N,H(S)) are admissible, it is sufficient to show that all corner points of 
1Z(L, m — N, H(S)) are admissible. 

We shall consider proof by induction, where the induction is on the total number of encoders 
L. First consider the base case with L = 2. When L = 2, there is only one nontrivial (L, N, m) 
S-SSDC problem: the (2, 1,2) S-SSDC problem. Note that the rate region 11(2, 1,H(S)) has 
only one corner point: the symmetrical rate pair (H(S), H(S)), whose admissibility has already 
been established. We thus conclude that every rate tuple in 71(2, 1, H(S)) is admissible for the 
(2,1,2) S-SSDC problem. 

Now assume that for every nontrivial (L — l,N',m') S-SSDC problem, all rate tuples in 
lZ(L',m' — N',H(S)) are admissible. Based on this assumption, next we show that all corner 
points of lZ(L,m — N,H(S)) are admissible for the (L,N,m) S-SSDC problem. We shall 
consider the corner points with all strictly positive entries (it they exist) and those with at 
least one zero entry separately: 

1) By Proposition [1], the symmetrical rate tuple (H(S)/(m — N), . . . , H(S)/(m — N)) is the 
only corner point of lZ(L,m — N,H(S)) with all strictly positive entries (if it exists), 
whose admissibility has already been established. 

2) To prove the admissibility of the corner points of TZ(L, m—N, H(S)) with at least one zero 
entry, by the symmetry of the rate region TZ(L, m—N, H(S)) we may consider without loss 
of generality those with R^ = 0. Note that if an (n, (Mi, . . . , Mx_i)) code satisfies both 
asymptotically perfect reconstruction constraint (TJ]) and the perfect secrecy constraint ([5]) 
for the (L — 1, N, m— 1) S-SSDC problem, an (n, (Mi, . . . , Mx_i, 1)) code with the same 
encoding functions for encoders 1 to L — 1 (encoder L uses a constant encoding function) 
also satisfies (trivially) both constraints for the (L, N, m) S-SSDC problem. Thus, if 
(Ri, . . . ,Rl-\) is an admissible rate tuple for the (L — l,N,m — 1) S-SSDC problem, 
then (Ri, . . . , Rl-i, 0) is also an admissible rate tuple for the (L, N, m) S-SSDC problem. 
By the induction assumption, all rate tuples in 1Z(L — l,m — N — l, H(S)) are admissible 
for the (L — 1, N, m — 1) problem. Combined with Proposition [T], this implies that all 
rate tuples in 



{(Ri,...,R L _i,0) : (Ri,...,R L _i)eTZ(L-l,m-N-l,H(S))} 

= {(R U ...,R L ) e 7l(L, m-N, H(S)) : R L = 0} 



(30) 
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i.e., the Rl = slice of lZ(L,m — N,H(S)), are admissible for the (L,N,m) S-SSDC 
problem. As a special case, all corner points of lZ(L,m — N,H(S)) with Rl = are 
admissible for the (L, N, m) S-SSDC problem. 

Combining Steps 1 and 2 proves that all corner points of TZ(L, m — N, H(S)) are admissible. 
We thus conclude that all rate tuples in 1Z(L,m — N,H(S)) are admissible. This completes 
the induction step and hence the proof of the theorem. 

3 Secure Symmetrical Multilevel Diversity Coding 
3.1 Problem Statement 

Let {Si [t], . . . , S^Arft]}^! be a collection of L — N independent discrete memoryless sources 
with time index t and let S£ := (Sfc[l], . . . , Sfc[n]). An (L,N) S-SMDC problem consists 
of a set of L encoders, a legitimate receiver who has access to a subset U of the encoder 
outputs, and an eavesdropper who has access to a subset A of the encoder outputs. Which 
subsets of the encoder outputs are available at the legitimate receiver and the eavesdropper 
are unknown a priori at the encoders. However, no matter which subsets U and A actually 
occur, the legitimate receiver must be able to asymptotically perfectly reconstruct the sources 
(Si, . . . , Sfc) whenever \U\ = N + k, and all sources (Si, . . . , Sl-n) must be kept perfectly 
secure from the eavesdropper as long as \A\ < N. 

Formally, an (n, (Mi, . . . , Ml)) code is defined by a collection of L encoding functions 

L-N 

e r . J]S fe n x/C^{l,...,M}, V/ = 1,...,L (31) 

k=l 

and decoding functions 

\U\-N 

du : J]{1, ...,M,}-> J] S[\ W C n L s.t. \U\ > N + 1. (32) 

l£U k=l 

Here, K is the key space accessible to all L encoders. A nonnegative rate tuple (i?i, . . . , Rl) 
is said to be admissible if for every e > 0, there exits, for sufficiently large block length n, an 
(n, (Mi, . . . , Ml)) code such that: 

• (Rate constraints) 

-logM<^ + e, V/ = 1,...,L; (33) 
n 

• (Asymptotically perfect reconstruction at the legitimate receiver) 

PT{du(Xu) (Sr, . . . , S? m _ N )} < e, VU C Q L s.t. \U\ > N + 1 (34) 

where Xi := e/((S", . . . , S£_ Ar ), if) is the output of the Zth encoder, K is the secret key 
shared by all L encoders, and Xy := \X\ : / G t/}; and 
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• (Perfect secrecy at the eavesdropper) 

H(S?, Sl_ N \X A ) = H(S?, S2_ N ), \/A C Q L s.t. \A\ < N (35) 

i.e., observing the encoder outputs X A does not provide any information regarding to 
the source sequences (S™, . . . , S2_ N ). 

3.2 Main Results 

Motivated by the success of [2H1] on the classical SMDC problem without any secrecy con- 
straints, here we focus on superposition coding where the output of the /th encoder Xi is given 
by 

X l= (xf\...,xr N) ) (36) 

and Xi is the coded message for source Sk at the /th encoder using an (L, N, N + k) S- 
SSDC code. Note here that all sources are encoded separately at the encoders, and there is 
no coding across different sources. Thus, if (R[ k) , . . . , R L k) ) is an admissible rate tuple for the 
(L, N,N + k) S-SSDC problem with source Sk, then the rate tuple 

(L-N L-N \ 

J]M fe \...,^4 fe) (37) 
fc=i k=i J 

is admissible for the (L, N) S-SMDC problem. 

By Corollary [2], the minimum sum rate for the (L, N,N + k) S-SSDC problem with source 
t>k is given by (L/k)H(S k ). It follows that YjZi (L/k)H(S k ) is the minimum sum rate that 
can be achieved by superposition coding for the (L, N) S-SMDC problem. The main result of 
this section is that Ylt=i ' (L/k)H(Sk) is in fact the minimum sum rate that can be achieved 
by any coding scheme for the (L, N) S-SMDC problem. Thus, superposition coding is optimal 
in terms of achieving the minimum sum rate for the general S-SMDC problem. We summarize 
this result in the following theorem. 

Theorem 3. Superposition coding can achieve the minimum sum rate for the general (L, N) 
S-SMDC problem, which is given by 

L-N , 
fe=l 

A proof of the theorem is provided in Sec 13.31 The proof uses an induction argument and is 
built on the classical subset inequality of Han [T2J Ch. 17.6] and the following key proposition. 

Proposition 2. For any (n, (Mi, . . . , Ml)) code that satisfies both asymptotically perfect re- 
construction constraint (|34|) and the perfect secrecy constraint (1331) . we have 

H(X D \S?, S%_ 17 X A ) > nH(Sk) + H(X D \S?, S%, X A ) - n5 k (n, e) (39) 



where 



5 k (n,e) := 1/n + e J]log |<S a | (40) 



a=l 



for any A G and D G such that A D D = and any k — 1, . . . , L — N. 
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3.3 Proof of the Main Results 

Let us first prove Proposition [2j Since \A\ = N, \D\ = k, and A n D = 0, we have \D U A\ = 
N + k. For any (n, (Mi, . . . , Ml)) code that satisfies both asymptotically perfect reconstruction 
constraint (1341) and the perfect secrecy constraint ( 1351) . we have by Fano's inequality 



ff(S?,...,S£|X D ,X A ) <ra$ fc (ra,e) (41) 

and 

if . . . , S£|X A ) = H(S?, . . . , S%). (42) 

Thus, 

H(X D \S?, S^, X A ) + n<5 fc (n, e) 

> H(X D \S™, . . . , S^i,X A ) + H(S™, . . . , S^\X D} X A ) (43) 

> H(X D \S±, . . . , S^i,X A ) + H(S%\Si , . . . ,£!%_!, X d ,Xa) (44) 
= H(X D ,S%\Si,...,S%_ 1 ,X A ) (45) 
= H(S%\S?,...,S%_ 1 ,X A )+H(X D \S?,...,SIX A ) (46) 
= if (S?, • • • , S%\X A ) - H(S?, S^\X A ) + H{X D \S?, S%, X A ) (47) 
= H(Si, . . . , S%) - H(S™, . . . , S^i\X A ) + H(X D \S™, . . . , S%,Xa) (48) 

> if (S* ...,££)- if (£?, • • • , S^O + if (X D |S?, • • • , S£, X A ) (49) 

— -^('S'fc l^f; • • • > Sfc-l) + ifpO?|<Sl\ • • • 5 S%,Xa) (50) 

= i?(^)+F(X D |^,...,^,X A ) (51) 
= nF(5' fc ) + F(X D |5'] l ,...,^,X A ) (52) 

where (I43p follows from ( l4Tj) . ( 148]) follows from (|42|) . (149]) follows from the fact that conditioning 
reduces entropy, (j5T1) follows from the fact that the sources Si, . . . , Sk are mutually indepen- 
dent, and ( 1521) follows from the fact that the source Sk is memoryless. Moving nSk{n,e) to the 
right-hand side of the inequality completes the proof of Proposition [2j 

Building on the result of Proposition |2l next let us show that for any (n, (Mi, . . . , Ml)) 
code that satisfies both asymptotically perfect reconstruction constraint ( 134"|) and the perfect 
secrecy constraint (1351) and any a — 1, . . . , L — N, we have 

L a j a 

Z=l fc=l fc=l 



where 



A ^ H{X D \S™, . . .,S™,Xa) ,_ .s 

^ a '~ / r. \ / r. _ at \ 2^ 2^ n ■ ^ 



L \ / L-iV \ , *-f a 



We shall consider proof by induction, where the induction is on a. First consider the base 
case with a — 1. Let A £ and let I £ fii\A Applying Proposition [2] with k = 1, we have 

if(X z ) > nF(S'i) +if(X^ 1 n ,X A ) -n$i(n,e). (55) 
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Averaging (j55p over all I G Ql \ A and all A G , we have 



1 



l l _ n ^ £ £ > n£T(Si) + ±Ax - nSfae). (56) 

iV ) I 1 



Aen W ien L \A 



Note that 



£ £ #(X0 = yX>TO- (57) 



£ ) ( L 1 N ) ief ^ L 1=1 

We thus have 

J2 H ( X i) >nLH(S 1 ) + A 1 -nL6 1 (n,e) (58) 

which completes the proof of the base case. 

Now assume that flSUl) holds for a — 1 for some 2 < a < L — N. Based on this assumption, 
next we show that (1531) also holds for a. By the classical subset inequality of Han [121 Ch. 17.6], 
for any A G we have 

1 ^ H{X \S?,...,S5_ lt XA) 



L-N \ ^ D ^L\~A 

a — 1 



> 1 H(X D |5p,...,Sg_ 1 ,^ A ) 

- ' L-N \ ^ D ^L\A 



(59) 



a 



It follows that 



/ F \ / t.-n\ 2^ 2^ ™ • (60) 



n){ a ) D ^ 
By Proposition [2J for any A G and any D G we have 

H(X D \S?, . . . , S^-i, X A ) > nH(S a ) + if (X^S?, . . . , S£, X A ) - n5 n (a, e). (61) 
Substituting ( IBTj) into (|60|) gives 

> 7 ^ r V V 

a 1 — / r. \ / t. at \ ^ £—1 



L \ L-N 

N j [ a j yit " L ^"l\a 



nH{S a ) + H(X D \S?, . . • , S2, X A ) - n5 a (n, e) 
(62) 

A a -nL<J a (n,e). (63) 
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By the induction assumption, 



L a—1 T a—1 

J2 H ( X i) > ^y^ + A^-^nWfc^e) (64) 

1=1 k=l k=l 

> ^^H(S k )+(^H(S a ) + A a -nL5 a (n,e)]-J2nL5 k (n,e) (65) 

k=l ^ ' k=l 

a T a 

= £^H(S k ) + A a -'irnL6 k {n,e). (66) 

k=l k=l 

This completes the proof of the induction step and hence (j55|) . 

Finally, let a = L — N in (15 3p . For any admissible rate tuple (R\, . . . , Rl) and any e > 0, 
we have 



L 

= 1 1=1 



5> z+e ) > J2 H ^ ( 6T ) 



L-N j L-N 

> J2T H ^ Sk) + AL - N ~^ nL5k ^ n,e) (68) 

k=l k=l 
L-N j L-N 

> (69) 
k=l k=l 

where fIBTJl) follows from the fact that A^_jv > 0. Divide both sides of fl6"9~l) by n and let n — > oo 
and e — > 0. Note that Sk{n, e) — > in the limit as n — > oo and e — > for all h — 1, . . . , L — N. 
We thus have 

L L-N j 

E^^Ei^) ( 7 °) 



fc=i 



for any admissible rate tuple (i?i, . . . , Rl)- This completes the proof of Theorem [3j 



4 Concluding Remarks 

This paper considered the problem of S-SMDC, which is a natural (perhaps also the simplest) 
extension of the classical SMDC problem [IH1] to the secrecy communication setting. First, 
the problem of encoding individual sources, i.e., the S-SSDC problem, was studied. A precise 
characterization of the entire admissible rate region was established via a connection to the 
problem of secure coding over a three-layer WN [8] and utilizing some basic polyhedral structure 
of the admissible rate region. Building on this result, it was then shown that the simple coding 
strategy of separately encoding individual sources at the encoders (superposition coding) can 
achieve the minimum sum rate for the general S-SMDC problem. 

Based on the result of Theorem [3] (and the fact that superposition coding can achieve the 
entire admissible rate region for the classical SMDC problems without secrecy constraints), it is 
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very tempting to conjecture that superposition coding can in fact achieve the entire admissible 
rate region for the general S-SMDC problem. In Appendix [Bj we verify that this is indeed the 
case for the simplest nontrivial S-SMDC problem: the (3, 1) S-SMDC problem. Our proof relies 
on an explicit characterization of the superposition coding rate region via a Fourier-Motzkin 
elimination procedure. The optimality of superposition coding is then established by carefully 
using the results of Proposition [2j 

Extending such a proof strategy to the general (L, N) S-SMDC problem, however, faces a 
number of challenges. To begin with, the complexity of Fourier-Motzkin elimination procedure 
grows unboundedly as the total number of encoders L increases. Thus, establishing an explicit 
characterization of the superposition coding rate region for the general (L, N) S-SMDC problem 
appears to be very difficult. An alternative strategy is to look for an implicit characterization 
of the superposition coding rate region using optimal a -resolutions, similar to that [I] for the 
classical SMDC problem without any secrecy constraints. In fact, note from Theorem [1] that the 
admissible rate region of an (L, N, m) S-SSDC problem depends on the parameters N and m 
only via its difference m — N. As mentioned previously in Sec. [21 when N = 0, the (L, N, m) S- 
SSDC problem reduces to the classical (L, m) SSDC problem without any secrecy constraints. 
Thus, the admissible rate region of the (L, N,N + k) S-SSDC problem with source is 
identical to that of the classical (L, k) SSDC problem with the same source. As a result, the 
superposition coding rate region of the (L, N) S-SMDC problem with sources (Si, . . . , Sl-n) 
is identical to the superposition coding rate region of the classical SMDC problem with a total 
of L encoders and sources (Si, . . . , Sl) where the entropy rate of the source H(S{) = for 
I = L — N + 1, . . . , L. Based on this observation, the a-resolution characterization of the 
superposition coding rate region for the general SMDC problem can be directly translated to 
the S-SMDC problem. It remains to see whether the properties provided in [I] on optimal a- 
resolutions are sufficient for establishing the optimality of superposition coding for the general 
S-SMDC problem. This problem is currently under our investigations. 

A Proof of Lemma U 

Let D e Vt^~ N) and let A e fi^ . Since ADD = 0, we have \DUA\ = N + (m-N)=m. For 
any (n, (Mi, . . . , Ml)) code that satisfies both asymptotically perfect reconstruction constraint 
(jlj) and the perfect secrecy constraint (jSJ), we have by Fano's inequality 



H(S n \X D ,X A ) < n5(n,e 



(71) 



where 



S(n, e) = 1/n + elog |«S 



(72) 



and 



H(S n \X A ) = H(S n ). 



(73) 
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For any admissible rate tuple (R±, . . . , R£) and any e > 0, we have 



> (75) 

> H{X D \X A ) (76) 

> H(X D \X A )+H(S n \X D ,X A )-n5(n,e) (77) 
= #(X D ,S n LY A )-7i5(7i,e) (78) 
= H(S n \X A ) +H(X D \S n ,X A ) -n5(n,e) (79) 

> H(S n \X A ) -n5(n,e) (80) 
= H{S n ) -n5{n,e) (81) 
= nH(S)-n5(n,e) (82) 



where (1751) follows from the independence bound on entropy, (1761) follows from the fact that 
conditioning reduces entropy, (1771) follows from (ITT]) . (}8"Tj) follows from (1731) . and (1821) follows 
from the fact that the source S is memoryless. Divide both sides of (JS2J) by n and let n — > oo 
and e — ?• 0. Note that 5(n, e) — > in the limit as n — >• oo and e — > 0. We have from (182 p that 

J2Ri>H(S), \/Dedi~ N) . (83) 
This completes the proof of Lemma [TJ 



B The Admissible Rate Region of the (3, 1) S-SMDC 
Problem 

In this appendix, we show that superposition coding can achieve the entire admissible rate 
region for the (3, 1) S-SMDC problem (the simplest nontrivial S-SMDC problem). The result 
is summarized in the following theorem. 

Theorem 4. Superposition coding can achieve the entire admissible rate region for the (3, 1) 
S-SMDC problem, which is given by the collection of all rate triples (R±, R 2 , R3) satisfying 

Ri > HiSt) 

R2 > HiS,) 

R3 > H(Si) , , 

R, + R 2 > 2H(S 1 )+H(S 2 ) {QV 

R2 + R3 > 2H(S 1 ) + H(S 2 ) 

R 3 + R! > 2H(S 1 ) + H(S 2 ). 

Proof. Achiev ability. Consider the superposition coding scheme that separately encodes the 
sources S\ and S2 using the (3,1,2) and (3,1,3) S-SSDC codes, respectively. By Theo- 
rem [H the admissible rate region for the (3, 1,2) S-SSDC problem is given by all rate triples 



16 



(R?,R^,R?) satisfying 

R? > H(S 1 ) 

R? > H(S!) (85) 

and the admissible rate region for the (3,1,3) S-SSDC problem is given by all rate triples 
(Rf\Rf\Rf ] ) satisfying 



i?S 2) > 

R { 2 2) > 

Rf } > 

i?i 2) + 4 2) > h(s 2 ) 

4 2) +4 2) > H(S 2 ) 



(86) 



R^+R? > H{S 2 ). 
Following (I3TI) . all rate triples (i?i, R 2 , R3) as given by 

Ri = R[ 1] + Rf\ V/ = 1, 2, 3 (87) 

are admissible via superposition coding. Using Fourier- Motzkin elimination to eliminate R\ , 
/ = 1, 2, 3 and k = 1,2, from (!85l) - (l87j) . we obtain the explicit characterization of the superpo- 
sition coding rate region for the (3, 1) S-SMDC problem as expressed by (1841) . 

The converse. Next, we establish the optimality of superposition coding by proving that 
every inequality in (1841) must hold for all admissible rate triples (Ri, R 2 , R3) for the (3,1) 
S-SMDC problem. Let 

a + b, ifa + 6<3 , , 

a + b — 3, otherwise. 

For any admissible rate triple (i?i, R 2 , R3), any 1 — 1, 2, 3, and any e > 0, we have 



b :-- 



and 



n{Ri + e) > H{Xi) (89) 

> H(X t \X lm ) (90) 

> nH(S 1 ) + H(X l \S[\X lm )-n5 1 (n,e) (91) 

> nH (S x ) - nSx(n, e) (92) 



n{R l + R lm + 2e) 

> if (X, el ) (93) 

> if(^|X tel ) + if(X tel |X Jffi2 ) (94) 

> 2n#(S , 1 ) + ff(X^,X iffil ) + H(X lel \S?,X m ) - 2n* 1 (n,e) (95) 

> 2nH(S 1 ) + HiX^X^Xw) + ff(X Z ei|-5r,X te2 ) - 2nS 1 (n,e) (96) 
= 2r i if(5 1 ) + if(X,,X te i|5! l ,X te2 )-2n5 1 (n,e) (97) 

> 2nH(S 1 ) + (nH(S 2 ) + H{X h X l&1 \S^,S«,X lm ) - n5 2 (n,e)) - 2nd 1 (n,e) (98) 

> 2nH(S 1 )+nH(S 2 )-n5 2 (n,e)-2n5 1 (n,e). (99) 



17 



Here, (190]) . (|94|) and (j96l) follow from the fact that conditioning reduces entropy, and (}9Tj) . (|95|) 
and (1981) follow from Proposition [2j Dividing both sides of (|92|) and (199]) by n and letting 
n — > oo and e — > complete the proof of the converse part of the theorem. □ 
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